Qhub Blip Image Captioning Finetuned
Apache-2.0
A fine-tuned version of the BLIP model for the visual question-answering task on retail product images, fine-tuned on a custom dataset annotated with images and product descriptions from online retail platforms.
Image-to-Text
Transformers Supports Multiple Languages